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Mutation Detection Using MutS and RecA 

BACKGROUND OF THE INVENTION 

Field of the Invention 

Tlie present invention in the fields of molecular biology and medicine relates to a metliod 
for detecting mutations and polymorphisms involving as little as one base change (Single 
Nucleotide Polymorphism - SNP) or a single base addition to or deletion from the wild-type 
DNA sequence. 

Description of the Background Art 

Progress in human molecular and medical genetics depends on the eflScient and accurate 
detection of mutations and sequence polymorphisms, the vast majority of which results from 
single base substitutions (Single Nucleotide Polymoqjhisms - SNP) and small additions or 
deletions. Assays capable of detecting the presence of a particular mutation, SNP or mutant 
nucleic acid sequence in a sample are therefore of substantial importance in tihie prediction and 
diagnosis of disease, forensic medicine, epidemiology and public health. Such assays can be 
used, for example, to detect the presence of a mutant gene in an individual, allowing 
determination of the probability that the individual will suffer from a genetic disease. The ability 
to detect a mutation has taken on increasing importance in early detection of cancer or discovery 
of susceptibility to cancer with the discovery that discrete mutations in cellular oncogenes can 
result in activation of that oncogene leading to the transformation of that cell into a cancer cell 
and that mutations inactivating tumor suppressor genes are required steps in the process of 
tumorigenesis The detection of SNPs has assumed increased importance in the identification and 
localization (mapping) of genes, including those associated with human and animal diseases. 

The desire to increase the utility and applicability of such assays is often frustrated by 
assay sensitivity as well as complexity and cost Therefore, it would be highly desirable to 
develop more sensitive, simple and relatively inexpensive assays for detection of alterations in 
DNA. 

Nucleic acid detection assays can be based on any of a number of characteristics of a 
nucleic acid molecule, such as its size, sequence, susceptibility to cleavage by restriction 
endonucleases, etc. The sensitivity of such assays may be increased by altering the manner in 
which detection is reported or signaled to the observer. Thus, for example, assay sensitivity can 
be increased through flie use of detectably labeled reagents such as enzymes (Kourilsky et ai, 
U.S. Pat 4,581,333), radioisotopes (Falkow etal, U.S. Pat 4,358,535; Beminger, U.S. Pat 

I 
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4,446^37), fluorescent labels (Albarella et al., EP 144914), chemical labels (Sheldon HI et al., 
U.S. Pat. 4,582,789; Albarella et al, U.S. Pat 4,563.417), modified bases (Miyoshi et al., EP 
1 19448), and the like. 

Most methods devised to attempt to detect genetic alterations consisting of one or a few 
5 bases involve amplification of specific DNA regions by PGR and many involve hybridization 
between a standard nucleic acid (DNA or KNA) and a test DNA such that the mutation is 
revealed as a mispaired or unpaired base in a heteroduplex molecule. Detection of these 
mispaired or unpaired bases has been accomplished by a variety of methods (Myers, RM, et al.. 
Cold Spnng Harbor Symp. Quant. Biol. 57:275-284 (1986); Gibbs, R. et al.. Science 236:303-305 

10 (1987); Lu, AS. et al., 1992, Genomics 14:249-255 (1992); Cotton, RG, et al., Proc. Natl. Acad. 
Sci. USA 55:4397-4401 (1988); Cotton, RG, iV«c/. Acids Res 77:4223-4233 (1989); Lishanski, A. 
et al., Proc. Natl. Acad. Sci. USA Pi :2674-2678 (1994); Wagner, RE, et al, Nucl Acids Res. 
25:3944-3948 (1995); Debbie, P. et al, Nucl Acids Res 25:4825-4829 (1997). These methods all 
suffer &om the requirements that: (1) a specific DNA region must be amplified by polymerase 

15 chain reaction (PGR) prior to mutation or SNP detection and (2) amplified DNA must be 

denatured and allowed to aimeal with some standard DNA of known genotype in order to allow 
the formation of mismatches or unpaired bases. PGR amplification introduces errors during 
amplification (it is a relatively low-fidelity process) and denaturation and annealing, particularly 
of genomic DNA or of large amplicons, can leave large, un-annealed single stranded firagments 

20 which can adopt secondary stmctures containing regions of double stranded DNA with unpaired 
or mispaired bases. These mismatched and unpaired bases can interfere with the detection of the 
target mismatches or unpaired bases. Many of these methods also require that the exact location 
of the mutation be known and are difficult to interpret when the sample DNA is heterozygous for 
the mutation in question. Therefore, most are not practical for use in screening for mutations and 

25 SNPs. 

MutS and RecA are bacterial proteins involved in DNA repair and genetic recombination 
and have been best characterized in E. coli. MutS is the mismatch recognition and binding 
protein of the E, coli mismatch repair system^ which fimctions to repair errors made by DNA 
polymerase during DNA replication. The system also recognizes mismatches in the hybrid 
30 overlaps created in the initial steps of genetic recombination and acts on such nnsmatch 

containing regions to abort recombination. Thus, the mismatch repair system is an editor both in 
DNA replication and genetic recombination and assures high fidelity in both processes. (The 
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editing of recombination is essential to avoid chromosomal rearrangements, to allow successfiil 
meiosis and to erect a genetic barrier between closely related species.) 

MutS has been used in mutation and SNP detection (Lishanski et al, supra; Wagner et 
al, supra; Debbie et al. supra; patents by Wagner and colleagues - U.S. 6,027,877, 6,1 14,1 15, 
6,120,992, 6,329,147; and Gifford U.S. Pat. 5,750,335). When used in solution, as in filter 
binding or gel shift assays (Jiricny et al., Nucl Acids Res. 7(^:7843-7853 (1988); Lishanski et al., 
supra) MutS performs poorly, in that it does not detect most mismatches and exhibits high levels 
of background binding of non-mismatched DNA. Innnobilized MutS has a greatly increased 
ability to bind mismatches and a greatly diminished ability to bind DNA without mismatches 
(Wagner et al, supra). However, even immobilized MutS suffers from PCR induced errors in 
DNA, including both misincoiporation and mispriming, and single stranded DNA is a powerful 
competitor in mutation and SNP detection assays. 

RecA, a bacterial recombinase which has been best characterized in E. coli, is the key 
player in the process of genetic recombination, in particular in the search and recognition of 
sequence homology, and the initial strand exchange process. RecA can catalyze strand exchange 
in the test tube. Recombination is initiated when multiple RecA molecules coat a stretch of single 
stranded DNA (ssDNA) to form what is known as a RecA "filament." This filament, in the 
presence of ATP, searches for homologous sequences in double stranded DNA (dsDNA). When 
homology is located, a tiiree stranded (D-Ioop). structure is fonned wherein flie RecA filament 
DNA is paired with the complementary strand of the duplex. If pairing is not perfect, i.e.. if there 
are mismatches or unpaired bases m the newly created duplex, MutS can bind to these sttuctures 
and mobilize the other proteins of the mismatch repair system which act to abort the 
recombination event by removing the filament DNA and restoring flie original duplex. 
Considerable evidence (most of it still unpublished) suggests that RecA and MutS co-localize 
during recombination and that RecA binding to DNA may fecilitate MutS mismatch recognition, 
perhaps by improving flie presentation of mismatches to MutS. 

RecA has been used to feciUtate screening of plasmid libraries for plasmids containing 
specific sequences (Rigas et al. Proc Natl Acad Sci USA. 83:9591-9595 (1986)). hi this 
application, biotinylated single stranded DNA probes are reacted witib RecA to form RecA 
fflaments. The filaments are used for homology searching in circular plasmid DNA. When the 
probes are removed by binding to avidin, those plasmids containing sequences homologous to the 
probes are isolated by virtue of the triple stranded (D-loop) structures formed by the RecA 
filament and flie plasmid duplex. In order for these structures to be stable it is necessary to use 
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adenosine 5'-[y-thio]triphosphate (ATP[y-S]) in place of ATP. ATP[y-S] allows homology 
searching by RecA, by is non-hydrolyzable and flius does not allow RecA dissociation from the 
triple stranded structure. - 

■ - 

RecA has also been used to facilitate the mapping of specific DNA regions from bacterial 
and human genomic DNA (Ferrin, U, et aL, Science 25^:1494-1497 (1991); Ferrin, LJ, et aL, 
Nature Genetics d:379-383 (1994)). In these applications, RecA is used in conjunction with 
restriction enTymes (sequence specific double strand DNA endonucleases) to allow isolation or 
identification of specific DNA firagments. RecA filaments are prepared and reacted with genomic 
DNA under conditions that allow triple strand (D-loop) structure formation. The DNA is then 
treated with either a restriction endonuclease or a modification methylase (methylase action 
transfers a methyl group to the specific recognition sequence of a specific restriction 
endonuclease, thus protecting the sequence from endonuclease digestion). The presence of the 
RecA filament in tiie triple strand structure prevents digestion or methylation. 

Specific RecA filaments have also been used to protect restriction endonuclease generated 
"sticky-ends'* from being filled in by DNA polymerase such that, upon removal of the RecA 
filaments, specific fragments can be cloned into plasmid vectors (Ferrin et aL, U.S. Pat. 
5,707,811). 

Formation of RecA catalyzed double D-loops has been used to identify and isolate 
specific DNA regions from double stranded DNA (Sena et aL, U.S. Pat. 5,273,881 and 
5,670,3 16), This method requires relatively long DNA probes (>78 nucleotides) and 
complementaritv between the probes and double D-loops in order to provide for a stable stmcture. 
This contrasts fimdamentally from the present invention wherein the probe and test DNA that 
together form a D loop must have sequence differences that result in formation of mispaired or 
unpaired bases in a probe/test duplex region (to allow detection of a mutation or SNP). These 
documents note the possibility of introducing a detectable label into the probe by oligonucleotide 
extension with DNA polymerase. Importantly, this method is only suited for detection of specific 
sequences in a test DNA but is of no use in detecting mutations or SNPs - the object of the present 
invention. Moreover, the teaching of these patents is limited to use of a double D-loop. There is 
no suggestion of using of single D-loops, even for the limited purpose of sequence detection. 

All statements as to the date or representation as to the contents of these documents is 
based on the information available to the applicant and does not constitute any admission as to the 
correctness of the dates or contents of these documents. 
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SUMMARY OF THE INVE^mQN 

The present invention is directed to a method for detecting of a mutation and/or a SNP in 
a double-stranded test DNA molecule, comprising: 

(a) providing a single stranded DNA probe which is optionally detectably labeled, which 
probe has (i) a known nucleotide sequence or (ii) a sequence complementary to the 
sequence of at least a part of the test DNA. 

(b) contacting the probe with a RecA protein (or a homologue thereof defined in more detail 
below), which is optionally detectably labeled, to form a RecA filament, 

(c) contacting the RecA filament with the test DNA, thereby forming a three stranded DNA 
D-loop structure in the test DNA, which D-loop structure comprises the probe and the two 
strands of the test DNA; 

(d) contacting the DNA D-loop structure with MutS (or a homologue thereof, defined in more 
detail below), which is optionally detectably labeled, wherein the MutS binds to one or 
more base pair mismatches or unpaired bases present in the duplex portion of D-loop 
stmcture; 

(e) detecting the presence of MutS bound to the DNA D-loop structure 

wherein the presence of flie bound MutS is indicative of the presence of the mutation or the SNP 
in the test DNA. 

Also provided is a method for detecting a mutation and/or a SNP in a double-stranded test 
DNA molecule, comprising: 

(a) providing a probe comprising two complementary single stranded oligonucleotides which 
are optionaUy detectably labeled, which probe has (i) a known nucleotide sequence or (ii) 
a sequence complementary to the sequence of at least a part of flie test DNA; 

(b) contacting each of the oligonucleotides in smgle stranded fo;nn with a RecA protein, 
which is optionally detectably labeled, to form RecA filaments, 

(c) contacting the RecA filaments with the test DNA, thereby forming a four stranded DNA 
structure in the test DNA, which stmcture comprises the probe and the two strands of the 
test DNA and wherein the probe strands are annealed witii test DNA strands; 

(d) contacting said DNA stractuie with a MutS protein which is optionally detectably labeled, 

* 

wherein the MutS binds to one or more base pair mismatches or unpaired bases present in 
the four stranded DNA stmcture; 

(e) detecting the presence of the MutS bound to the DNA stmcture 
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wherein the presence of the bound MutS is indicative of the presence of the mutation or tlie SNP 
in the test DNA. 

Another method for detecting a mutation and^or a SNP in a double-stranded test DNA 
molecule, comprises: 

(a) providing a single stranded DNA probe which is optionally detectably labeled, which probe 
has (i) a known nucleotide sequence or (ii) a sequence complementary to the sequence of at 
least a part of the test DNA molecule; 

(b) contacting the probe with a RecA protein which is optionally detectably labeled, to form a 
RecA filament, 

(c) contacting the RecA filament with the test DNA, thereby forming a three stranded DNA D- 
loop stiucture in the test DNA, which stmcture comprises the probe and two strands of the 
test DNA; 

(d) contacting the DNA D-loop stmcture with immobilized MutS which binds to one or more 
base pair mismatches or unpaired bases present in the duplex portion of the D-loop stmcture; 

(e) detecting tiie presence immobilized probe DNA or RecA bound to the MutS, 
wherein the presence of the bound probe DNA or RecA is indicative of the presence of flie 
mutation or the SNP in the test DNA. 

The invention includes a method for detecting a mutation and/or a SNP in a double- 
stranded test DNA molecule, comprising: 

(a) providing a probe comprising two complementary single stranded oligonucleotides which are 
optionally detectably labeled, which probe has (i) a known nucleotide sequence or (ii) a 
sequence complementaiy to the sequence of at least a part of tiie test DNA; 

(b) contacting each of the probe oligonucleotides in single stranded form with a RecA protein, 
which is optionally detectably labeled, to form RecA filaments, 

(c) contacting the RecA filaments with the test DNA, thereby forming a four stranded DNA 
stmcture in the test DNA, which stmcture comprises the two strands of the test DNA to each 
of which is annealed a probe oligonucleotide strand; and 

(d) contacting said DNA stmcture with immobilized MutS which binds to one or more base pair 
mismatches or unpaired bases present in said four stranded DNA stmcture, 

thereby detecting the mutation and/or SNP. 

In the above method, test DNA molecule may be prokaryotic genomic DNA, eukaryotic 
genomic DNA, cDNA, viral DNA, plasmid DNA, and a DNA firagment ampUfied by PGR or by 
another amplification method. 

6 
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To aflBrmatively detect a mutation or SNP, the above probe sequence must differ from the 
sequence of the mutation or SNP by one or more nucleotide substitutions, additions or deletions 
such that the probe/test heteroduplex contains mismatched or impaired bases recognizable by 
MutS, or a MutS homologue or other mismatch-binding protein ("MBP"). If the test DNA 
sequence is identical to the probe, the test result is ''negative." It should be understood that "a 
probe" as used above may include **one or more" different probe molecules. The probe is 
preferably an oligonucleotide of about 20 to about 60 nucleotides, and is preferably selected from 
the group consisting ofi (a) a synthetic oligonucleotide; (b) a recombinant oligonucleotide; and 
(c) an oligonucleotide obtained by denaturing, and, optionally, cleaving, a double stranded DNA 
molecule. When the probe comprises two, complementary DNA molecules, they may be 
separately coated with RecA. The probe may include an adduct, which may an oligonucleotide, 
biotin or digoxigenin, or the like, to allow immobilization following D-loop formation. 

The RecA protein is preferably from E, colL 

In the methods described herein, detection is based on the use any one of the components 
detectably labeled: the probe DNA, the RecA, the MutS, (or SSB, discussed below). The label 
may be any suitable detectable label, e.g,, a fluorophore, a chromophore, a radionuclide, biotin, 
digoxigenin, etc. The protein or DNA may be labeled via a bead to which is attached the above 
fluorophore, chromophore, biotin, etc. The probe may be labeled by DNA polymerase extension 
using labeled deoxynucleotide triphosphates or nucleotide terminators. 

Preferred detection is of bound MutS which may be in solution or immobilized to a solid 
surface such as nitrocellulose, polystyrene, magnetic beads or the like. The DNA, RecA or MutS 
may be directly labeled by direct bonding or binding of the label to the protein. However, the 
term "detectably labeled," whether referring to a protein or DNA, includes "indirect" labeling 
wherein the "detectable label" is a prhnary antibody, or any other bindmg partner, of that protem 
or DNA, which is directly labeled. Alternatively, the detectable label is a combmation of an 
unlabeled primary antibody (e.g.„ anti-MutS, anti-RecA, anti-SSB) with a directly labeled 
secondary antibody specific for the primary antibody. 

In the present method, MutS (or its homologue), may be in solution or immobilized to any 
solid support. 

When the RecA or the probe above is labeled, a preferred label is a fluorophore, a 
chromophore, a radionuclide, biotin, digoxigenin, and wherein association of the probe label with 
the MutS label or the RecA label with the MutS label, is indicative of the presence of the 
mutation or the SNP in the test DNA. 



.0207728eAl I > 



wo 02/077286 PCT/US02/04875 

When the MutS or MutS homologue is labeled, the label is preferably a fluorophore, a 
chromophore, a radionuclide, biotin, digoxigenin, or a labeled bead. A preferred label for RecA 
or the RecA homologue is a fluorophore. Wlien the detectable label is a fluorophore a preferred 
detection method is flow cytometry. 
5 In the above method, the MutS or MutS homologue may be immobilized to a solid 

. support. 

In a preferred embodiment the RecA protein is labeled and the detection is of the MutS 
> label associated with the RecA label present in the DNA D loop structures. 

In another embodiment, the detectable label of RecA or its homologue is in the form of a 
10 labeled primary anti-RecA antibody, or a combination of an unlabeled primary anti-RecA 
antibody and a labeled secondary antibody specific for the primary antibody. 

MutS binding to the duplex portion of the triple strand or D-loop structure stabilizes the 
structure, allowing use of relatively short oligonucleotides. This allows separate detection of 
mutations and SNPs which may be close together. Thus, the use of MutS in the present method is 
15 an important general improvement over the prior art, e.g,, the Sena et al patents, as it serves as 
the basis for discriminating between a D loop structure that includes a mutation or SNP and one 
that does not This is because D-loop structures that do not bind MutS are those in which the 
probe and test DNA are perfectly paired, without any mismatches or unpaired bases, a state that 
favors dissociation of the probe from the test DNA. This has the additional advantage of 
20 minimizing background signals. 

La the above method, the DNA D loop structure may be forther stabilized by the addition, 
before step (d) above of the single strand DNA binding (SSB) protein (Chase et al, Nucl Acids 
Res 8:3215-3227 (1980)), or an SSB homologue, which is optionally detectably labeled. When 
= the SSB protein is labeled, the label may be a fluorophore, a chromophore, a radionuclide, biotin, 
25 : digoxigenin, or a labeled bead, and the association of the SSB label with the MutS label is 
indicative of the presence of the mutation or the SNP in the test DNA. 

Stability of the three strand structure can also be enhanced by allowing DNA synthesis to 
. extend the oligonucleotide. Such extension requires addition of a DNA polymerase and all four 
deoxynucleotide triphosphates. 
30 In the above method, flow cytometric detection may detect the coincidence of two, three 

or four labels which are bound to: (a) the MutS and the probe; (b) the MutS and the RecA; (c) the 
MutS, the RecA and the probe; (d) the MutS and the SSB; (e) the MutS, the SSB, and the probe; 

or (f) the MutS, the SSB, the probe and tiie RecA. 

* * 
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The present invention also provides a kit useful for detecting a one or more mutations or 
polymorphisms in a DNA sample, the kit being adapted to receive therein one or more containers, 
the kit comprising: - 

(a) a first container containing a RecA protein, which is optionally detectably labeled; 
5 (b) a second container containing MutS protein which is optionally detectably labeled; and 

(c) a third container or plurality of containers containing buffers and reagent or reagents capable 
of detecting the binding of MutS or the MutS homologue. 

Also included is a kit useful for detecting a specific mutation or polymorphism or a 
specific group of mutations or polymorphisms in a DNA sample, or for examining a specific 
1 0 region or regions of DNA for any mutations or polymorphisms, the kit being adapted to receive 
therein one or more containers, the kit comprising: 

(a) a first container containing RecA protein, or a homologue thereof, which is optionally 
detectably labeled; 

(b) a second container containing MutS protein which is optionally detectably labeled; 

15 (c) a third container or plurality of containers containing a specific oligonucleotide probe or 
probes, which probes are selected to be complementary to specific sequences in specific 
regions in the DNA of the sample and which form mismatch-containing or unpaired base- 
containing heteroduplexes with a mutated or polymorphic sequence or sequences in the 
specific DNA regions, which probe or probes is or are optionally detectably labeled; and 

20 (d) A fourth container or plurality of containers containing buffers and reagents capable of 

detecting the binding of MutS or its homologue to specific heteroduplexes formed between 
the probes and the sample DNA. 

BRffiF DESCRIPTION OF THE DRAWINGS 

Figures 1-7 are schematic representations of the RecA + MutS mutation/SNP detection 
25 method including various detection modalities. 

Figure 1 shows an oligonucleotide "probe" to which is added in Step (1) the RecA ( O ) 
protein, RecA coats the probe to fomi a "RecA filament" In Step (2) RecA filament is added to 
test DNA and allowed to form a triple stranded or "D-loop" structure. In Step (3), the MutS 
protein is added. If the probe is identical to the test DNA sequence, a perfectly paired duplex 
30 ("no mismatch") is formed and the MutS does not bind (left). If there are one or more sequence 
differences between the probe and test DNA sequences, a heteroduplex is formed containing one 
or more mismatches or unpaured bases ("Mismatch (SNP)") and MutS binds to that heteroduplex. 
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Figure 2 shows the JBnal complex formed after Step (3). Here both the probe and 
MutS are labeled. X = probe label. * = MutS label. 

Figure 3 shows the final complex formed after Step (3) when MutS is labeled (*) and 
RecA is labeled ( * ) 

5 Figure 4 shows the final complex formed after Step (3) when the SSB protein (O ) has 

been added. The labeled (^) SSB binds to ssDNA. Here, MutS is also labeled ★ 

Figure S shows the final complex formed after Step (3) where MutS is labeled and the 
probe has been labeled by polymerase extension using labeled deoxynucleotide triphosphates 

10 Figure 6 shows the final complex formed after Step (3) when MutS is immobilized to a 

solid surface (diagonal lines). In this embodiment, the probe is labeled (X). 

Figure 7 shows the final complex formed after Step (3) when the probe (whose sequence 
has a mismatch or mispairing with the test DNA) was in the form of two complementary 
oligonucleotides each of which was "coated" with RecA (O). MutS is labeled 

15 The structure formed is a double D-loop. 

DESCRIPTION OF THE PREFERRED EMBO DTMF.NT5; 

The present inventors have devised a novel technology for detecting mutations or SNP*s 
using a combination of at least two DNA binding proteins, RecA and MutS. In general, the 
method employs: 

20 (1) a test DNA molecule, which may be any synthetic, viral, plasmid, prokaiyotic or 
eukaryotic DNA fi-om any source and may be amplified by PGR or any other means; 
(2) a DNA probe, which may be any synthetic oligonucleotide, PGR amplicon, plasmid DNA, 
viral DNA, bacterial DNA or any other DNA of known sequence or of sequence complementary 
to the test DNA or to a portion thereof, 

25 (3) E. coli RecA or a homologue thereof, as defined below, and 

(4) E, coli MutS or a homologue thereof, as defined below, or another mismatch-binding 
protein from any prokaryotic or eukaryotic species. 

As used herein and in the present claims (for the sake of brevity and clarity), the terai 
"MutS," "RecA" or "SSB" is intended to include either the native or mutant K coli MutS, RecA 

30 or SSB protein, or a "homologue" thereof as defined below. A "homologue" of MutS, RecA, 

SSB, etc., is a protein that has functional and, preferably, also structural/sequence similarity to its 
"reference" protein. One type of homologue is encoded by a homologous gene from another 

10 
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Species of tlie same genus or even from other genera. As described below, the above proteins, 
originally discovered in bacteria, have eukaiyotic homologues in groups ranging from yeast to 
mammals. A functional homologue must possess the biochemical and biological Sictivity of its 
reference protein, particularly tlie DNA binding selectivity or specificity so that it has the utility 
5 described herein.. In view of this fimctional characterization, use of homologues of ^. coli RecA, 
MutS or SSB proteins, including proteins not yet discovered, fall within the scope of the invention 
if these proteins have the described DNA binding activity or "improved" binding activity. 
Nonlimiting examples of improvements include recognition of the C:C mismatch by a MutS 
homologue, a RecA homologue ftat binds to shorter DNA molecules, or higher afBnity binding of 
1 0 single stranded DNA by a SSB homologue. 

"Homologue" is also intended to include those proteins altered by mutagenesis or 
recombination which has been performed to improve the protein's desired function for use in this 
invention. These approaches are generally described and well-referenced below. Clearly, it is 
within the skill of the art to develop such genetically engineered homologues without resorting to 
15 undue expramentation. Thus, for example, one would apply these approaches, starting with, for 
example, DNA encodmg a "native" MutS protein (mutagenesis) or two or more DNA molecules 
each encoding different "native" MutS proteins (recombination), express the gene product, and, 
using known screening techniques (including the methods of fliis invention), measure the 
appropriate DNA binding activity. Hence, even in the absence of specific examples of genetically 
20 engineered, improved MutS, RecA or SSB homologues, those skilled in the art are enabled to 
produce and identify such homologues using only routine experimentation. 

Mutagenesis of a protein gene, conventional in the art, is generally accomplished in vivo 
by cloning the gene into bacterial vectors and duplicating it in cells under mutagenic conditions, 
e.g.^ in the presence of mutagenic nucleotide analogs and/or under conditions in which mismatch 
25 repair is deficient. Mutagenesis in vitro^ also well-known in the art, generally employs error- 
prone PGR wherein the desired gene is amplified imder conditions (nucleotide analogues, biased 
triphosphate pools, e/c.) that favor misincoiporation by fee PGR polymerase. PGR products are 
then cloned into expression vectors and the resulting protems examined for function in bacterial 
cells. 

^® Recombination generally involves mixing homologous genes from different species, 

allowing them to recombine, frequently under mutagenic conditions, and selecting or screening 
for improved function of the proteins from the recombined genes. This recombination may be 
accomplished in vivo, most commonly in bacterial cells under mismatch repair-deficient 

ft 
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conditions which allow recombination between diverged sequences and also increase the 
generation of mutations. One of the present inventors has developed such methods of protein 
"evolution" (Radman et al. US patents 5,912,119 and 5,965,415). In addition,"Stemmer and 
colleagues have devised methods for both in vivo and in vitro recombination of diverged 
5 sequences to create "improved" proteins. Most involve PGR "shuffling'' wherein two diverged 
sequences are digested and mixed together such that the fragments serve as both primer and 
template for PGR and, in so doing, combine different segments of the diverged genes, which is, in 
effect, genetic "recombination," Frequently, error prone PGR conditions are included to furHier 
stimulate generation of novel sequences. Resulting PGR products are cloned into expression 

10 vectors, and the resulting proteins are screened for improved function. See, for example, U.S. 

Patents 5,512,463; 5,605,793; 5,81,238; 5,830,721; 5,837,458; 6,096,548; 6,117,679; 6,132,970; 
6,165,793; 6,180,406; 6,251,674; 6,277,638; 6,287,861; 6,287,862; 6,291,242; 6,297,053; 
6,303,344; 6,309,883; 6,319,713; 6,319,714; 6,323,030; 6,326,204; 6,335,160; 6,344,356. 

As noted, homologues of the present invention generally share sequence similarity with their 

15 reference protein. To determine the % identity of two amino acid sequences (or of two nucleic acid 
sequences), the sequences are aligned for optimal comparison pmposes (e.g., gaps can be introduced 
in one or both of a first and a second amino acid or nucleic acid sequence for optimal alignment and 
non-homologous sequences can be disregarded for comparison purposes) ! In a preferred method of 
alignment, Cys residues are aligned. The length of a sequence being compared is at least 30%, 

20 preferably at least 40%, more preferably at least 50%, even more preferably at least 60%, and even 
more preferably at least 70%, 80%, or 90% of the length of the reference sequence {e.g., E, colt 
MutS or E. coll RecA). The amino acid residues (or nucleotides) at corresponding amino acid (or 
nucleotide) positions are then compared. When a position in the jSrst sequence is occupied by the 
same amino acid residue (or nucleotide) as the corresponding position in the second sequence, then 

25 the molecules are identical at that position. As used herein ammo acid or nucleic acid **identily'' is 
also to be considered amino acid or nucleic acid "homology". The % identity between the two 
sequences is a fimction of the number of identical positions shared by the sequences, taking into 
account the number of gaps and the length of each gap which need to be introduced for optimal 
alignment. The comparison of sequences and determination of % identity between two sequences 

30 can be accomplished using mathematical algorithms, e.g., the Needleman and Wunsch (J. Mol Biol 
-/S:444-453 (1970) algorithm which has been incorporated into the GAP program (see below) using 
either a Blossom 62 matrix or a PAM250 matrix. A preferred program, "GAP" in the GGG software 
package, available at httD://www.gcg.com. uses a NWSgapdna.GMP matrix and a gap weight of 40, 
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50, 60, 70, or 80 and a length weight of 1, 2, 3, 4, 5, or 6. In another approach, the % identity 
between two amino acid or nucleotide sequences is determined using the algorithm of E. Meyers and 
W. Miller (CABIOS, 4: 11-17 (1989)) which has been incorporated into the ALIGN program 
(version 2,0), using a PAM120 weight residue table, a gap length penalty of 12 and a gap penalty of 
5 4. 

The nucleic acid or protein sequence of a particular MutS, RecA or SSB protein can 
further be used as a "query sequence" to perform a search against public databases, for example, 
to identify other family members or related sequences. Such searches can be performed using the 
NBLAST and XBLAST programs (version 2.0) of Altschul et al (1990) J, Mol. Biol 215:403- . 
10 4 1 0. To obtain gapped alignments for comparison purposes. Gapped BLAST can be utilized as 
described in Altschul et al. (1997) NucL Acids Res. 25:3389-3402. When using BLAST and 
Gapped BLAST, the de&ult parameters of the respective programs can be used. See 
http://www.ncbi.nlm,nih.gov. For example, BLAST nucleotide searches can be performed with 
the NBLAST program, score=100, wordlengti[i==12 to obtain nucleotide sequences homologous to 
15 a query MutS, RecA or SSB coding nucleic acid sequence. BLAST protein searches can be 
performed with the XBLAST program, preferably set at score = 50, wordlength = 3, to obtain 
amino acid sequences homologous to a query MutS, RecA or DNA protein molecule (e.g., wild- 
type sequence from £. coli). 

Thus, a preferred homologue of an JS. coli MutS protein, an E. coli RecA protein or an E. 
20 coli SSB protein has, first and foremost, the functional activity of native E. coli MutS (or RecA or 
SSB), or even improved activity over the native protein as noted above. A preferred homologue 
also shares sequence similarity with the native E. coli protein, when determined as above, of at 
least about 20% (at tlie amino acid level), preferably at least about 40%, more preferably at least 
about 60%, even more preferably at least about 70%, even more preferably at least about 80%, 
25 and even more preferably at least about 90%. . 

At least 65 RecA genes from different bacteria have been cloned and sequenced (Sandler, 
SJ, et al, Nucl Acids Res 2^:2125-2132 (1996); Roca, Al, et al., Crit Rev Biochem Mol Biol 
25:415-456 (1990); Eisen, JA,/. Mol Evol ^i; 1105-1 123 (1995); Lloyd, AT, etal, J. Mol. Evol. 
57:399-407 (1993)). RecA homologues, known as RadA, have been identified in three archaean 
species (Sandler a/., 5wpra;; Seitz, EM, etal. Genes Dev. 72:1248-1253 (1998)). Eukaryotic 
homologues of RecA have been identified in every eukaryotic species examined; the prototype 
eukaryotic RecA homologue is flie yeast Rad51 protein (Seitz et al, supra\ Bianco, PR, et al. 
Frontiers Biosci. 5:570-603 (1998)). Therefore, any homologue ofE. coli RecA which, like the 
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£. coli protein, forms DNA filaments for initiation of genetic recombination as well as any 
fiinctional form that has been mutated or evolved in vivo or in vitro is included within the scope 
of the present invention. - 

Examples ofE. coli MutS homologues are the MutS protein of Salmonella iyphimuriwn 
5 (Lu et al, supra; Haber, LT, et al, J. Bacteriol 170: 197-202 (1988; Pang, PP, et ah, J, Bacteriol 
i(J3:1007-1015 (1985); and the HexA protein of Streptococcus pneumoniae (Priebe SD, et al, J. 
Bacteriol 170:190-196 (1988); Haber et aL, supra). In addition, eukaryotic homologues of MutS 
or HexA can also be used, such as those encoded by the homologous sequences identified in 
yeast, human, mouse, fix)g or hamster DNA (Shimada, T, et al, J. Biol Chem. 264:20171 (1989); 

10 Linton, J, et al, Mol Cell Biol 7:3058-3072 (1989); Fujii, H. et al, J, Biol Chem. 264:10051 
(1989)), The homology between MutS homologues in prokaryotic and eukaryotic species is 
illustrated in Reenan, RA et al^ Genetics 132:963-973 (1992), where the E. coli MutS nucleotide 
sequence is shown to be highly homologous in one region to S. typhimurium MutS, S. 
pneumoniae hexA, mouse Rep-1, and human DUC-1. PGR primers which successfully led to the 

15 cloning of Saccharomyces cerevisiae (yeast) homologues of MutS, named MSHl and MSH2, 

were based on this homology. Reenan et al, supra, showed the amino acid sequence homology 
between yeast MSHl, MSH2 mdE. coli, S. typhimurium and S. pneumoniae MutS homologues. 
New, L. et al, Mol Gen. Genet 23P:97-108 (1993) disclosed another yeast gene, MSH3, which is 
a homologue of eukaryotic MutS and indicates the most conserved sequences among MutS, HexA 

20 and mouse REP-3. A search for a new yeast gene based on this sequence homology led to 

discovery of yeast MSH3. Fishel, R. et al. Cell 75: 1027-1029 (1993) described Ihe cloning of a 
another human MutS homologue (hMSH2) using for PGR the homologous sequences from oflier 
MutS homologues as described by Reenan et aly supra. 

Therefore, any homologue ofE. coli MutS which, like the E. coli protein, recognizes 

25 DNA mismatches (single base mismatches or several unpaired bases) as well as any functional 
form version that has been mutated or evolved in vivo or in vitro is included within the scope of 
the present invention. 

Both MutS and RecA function in vitro. MutS is the basis of the Gene Check Immobilized 
Mismatch Binding Protein (IMBP) mutation detection technology which is currently being used 
3 0 commercially to genotype sheep for scrapie susceptibility (Wagner et al , supra; Debbie et al , 
supra; U.S. Pat 6,027,877, 6,1 14,1 15, 6,120,992, 6,329,147, all of which references are 
incorporated by reference in their entirety). 
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RecA forms a three stranded structure in vitro along sequence stretches as short as 15 
nucleotides (Ferrin et aL, 1991, supra). Combining the activities of RecA and MutS, creates a 
most powerful mutation/single nucleotide polymorphism (SNP) detection system in which RecA- 
coated ssDNA catalyzes formation of a three strand (or "D-loop") structure without the need for 
5 prior denaturation of the test dsDNA. The D-loop will contain mismatches or unpaired bases 
when the sequences of the probe DNA and the test DNA are not identical. These mispaired or 
unpaired bases can be recognized by MutS (or any MutS homologue from any species or any 
other mismatch binding protein) MutS binding will stabilize the D-loop structure. Thus, the 
combination of MutS with RecA mediated D-loop formation allows formation of very small, 
10 stable D-loops (when such D-loops contain mispaired or unpaired bases), which in turn allows 
separate examination of short DNA intervals for mutations and SNPs as well as small scale 
scanning for known (and unknown) mutations and SNPs. Such examination is neither suggested 
nor supported by U.S. Pat. 5,273„881 and 5,670,316 cited above. 

The combined "RecA/MutS" method can be used with a variety of platforms some which 
1 5 allow mutation/SNP detection without the need for PGR amplification of the test DNA. 

RecA/MutS System wi th Flow Cytometric Detection of MutS-RecA Co-localization 

In one preferred embodiment, the present system employs: (1) labeled RecA and MutS; 
(2) specific probe oligonucleotides that are detectably labeled for detection by flow cytometry; 
and (3) flow cytometric detection of the labels. 

20 Probe specificity derives firom the probe's sequence: A probe is designed to be 

complementary to the "normal" or wild type, non-polymorphic sequence of, the site or region of 
interest as well as the flankmg region. When a mutation or polymorphism is present tiie 
probe/test heteroduplex will contain one or a few mispaired or unpaired bases. In the absence of 
a mutation or polymorphism, probe/test heteroduplex will be perfectly paired. 

25 Formation or stabilization of the D-loop formed by the RecA filaments and test DNA may 

be further enhanced by the addition of single strand binding (SSB) protem &om E. coli or a 
homologue of this protein firom another species or by allowing DNA polymerase catalyzed 
extension of the probe DNA using the test DNA as template. 

In this method detection of mutations and SNPs is accomplished by detecting the co- 

30 localization of either (a) RecA and MutS, (b) probe DNA and MutS or (c)RecA, MutS and probe 
DNA. Alternatively, labeled SSB, or a homologue can be used to label D-loop stmctures by 
binding to the single stranded portion. In this instance, co-localization of SSB signal witli MutS 
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signal is indicative of the presence of one or more mismatches or unpaired bases in the duplex 

portion of the D-loop. 

» 

The DNA probe may be of any length but is preferably an oligonucleotide, more 
preferably a synthetic oligonucleotide, of about 30-60 bases in length. 
5 The probe is specific for a particular mutation or polymorphism, or specific for a 

particular genetic region that is being examined for the presence of a known or unknown mutation 
orSNP. 

The test DNA may be of any length (up to an entire chromosome) and can be eitlier 
genomic or plasmid DNA or a PGR amplicon. 

10 Labeling of MutS, RecA and SSB proteins can be accomplished via a variety of well 

established methods: The proteins can be directly labeled wifli fluorophores or fluorescent labels^ 
including, but not limited to, fluorescein (and derivatives), 6-Fam, Hex, tetramefhylrhodamine, 
cyanine-5, CY-3, allophycocyanin, Lucifer yellow CF, Texas Red, rhodamine, Tamra, Rox and 
Dabcyl. Indirect labeling utilizes, for example, "primary** antibodies (monoclonal or polyclonal) 

1 5 specific for MutS or RecA which can be labeled (e.g., with fluorophores). Altematively, 

secondary antibodies, e,^., anti-immunoglobuliasuch as anti-isotype antibodies, specific for the 
primary anti-MutS and anti-RecA antibodies can be labeled (e.^., with fluorophores). 

In another embodiment, the proteins (MutS or RecA, primary antibodies or secondary 
antibodies) are biotinylated. The biotin is then bound by fluorescent avidin (or streptavidin). 

20 Altematively, streptavidin (which is multivalent) may first be bound to the biotin of the 

biotinylated protein, and then bound to other fluorescently-labeled biotinylated compounds. The 
proteins can be attached to fluorescent microbeads, or microbeads to which is attached a dififerent 
detectable label. Attachment of proteins to fluorescent microbeads may be by any methods well 
known in the art, including, but not limited to, direct adsorption to polystyrene or other beads, 

25 covalent linkage via carboxyl, amino, tosyl or other groups, binding via biotin/avidin or 
streptavidin interaction (requires biotinylation of the protein) and binding to immobilized 
antibody. The labeling of MutS by attaching it to a labeled bead is functionally eqiiivalent to 
immobilization (Wagner et aL, supra; Debbie et al , supra) and will, therefore, enhance MutS (or 
other mismatch binding protein) function similarly to the effect observed when MutS is 

30 immobilized to nitrocellulose or polystyrene. 

It is also possible to label MutS and the DNA probe (instead of, or in addition to, labeling 
of RecA). The probe may be labeled directiy with a fluorophore or with a compound such as 
biotin or digoxigenin and detecting the adducts by conventional methods. The probe is detected 
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by measurement of fluorescence, color luminescence or any other method suitable for the label 
that has been selected. 

Signal amplification can be introduced, for example, by using (a) labeled secondary 
antibodies or (b) avidin- or streptavidin- coated microbeads to bind biotin labeled probe. This 
5 will result, for example, in multiple biotinylated probes being bound, which will, in turn, be 

bound by MutS molecules (if the probe/test DNA complex contains one or more mismatches or 
unpaired bases) and thereby, greatly increasing the signal. 

The procedure is earned out by mixing the probe with RecA, MutS and the test DNA 
under conditions appropriate for formation of RecA filaments, D-loops or triple strand structures. 
10 . RecA filament formation can be accompKshed, for example, in a Tris-HCl or Tris-acetate buffer, 
(20-40 mM, pH 7.4-7.9) with MgClz or Mg acetate (1-4 mM), ditMothreitol (0.2-0.5 mM), and 
ATP or ATP[y-S] (0.3-1.5 mM). If ATP is used, an ATP regenerating system consisting of 
phosphocreatine and creatine kinase niay be included. RecA and probe are generaUy added at a 
molar ratio of 0.1-3 (RecA to nucleotides). Iffke probe is double stranded, it must first be 

15 denatured before RecA coating. Incubation is at room temperature or, preferably, 37° C, for 5-30 
min. D-loop or triple strand structure formation involves adding RecA filaments to double 
stranded DNA and incubating, preferably at 37°C, for about 15 min to about 2 hrs. It is also 
possible to form RecA filaments and do homology searching in a single reaction vessel, i.e., to 
mix RecA with oHgonucleptides and double stranded DNA at the same time. See, for example, 

20 Rigas et al. supra; Honigbeig, SM, et al, Proc Natl Acad Sci USA 83:9586-9590 (1986); any of 
the Ferrin e/ al. publications ( supra). 

Mismatch binding by MutS is accomplished by adding the MutS to the double stranded 
test DNA at or before the time RecA or RecA filaments are added. MutS may be in solution or 
immobilized. Generally, O.lng - 0.5ng of MutS is added. See, for example, Lishanski et al., 

25 supra; Wagner et al., supra; Debbie et al, supra; U.S. Pat 6,027,877, 6,1 14,115, 6,120,992, 
6,329, 147; Gififord, U.S. Pat. 5,750,335; Jiricny et al., supra. 

Following incubation to allow homology search by flie RecA filament and binding of 
mismatched or unpaired bases by MutS, the mixture is preferably analyzed by flow cytometry in 
the case of fluorescent labeling. The flow cytometer is set to detect as a signal the simultaneous 

30 presence of both labels (that on the MutS and that on the RecA and/or probe) or the presence of a 
•third" color created by the juxtaposition of the two (or three) labels. The presence of such signals 
is an indication of the presence in the san^le of sequences differing from the probe by one or a 
few single mismatches or unpaired bases. 

17 



BNSDOCIO: <WO 020772BeA1 I > 



wo 02/077286 PCT/US02/04875 

The power of the RecATMutS method described herein is that the background signals are 
very low, and RecA + MutS (or MutS + DNA probe or MutS + RecA -4- DNA probe) will be 
found together only under conditions in which RecA-coated oligonucleotide probe has bound to 
test DNA in a way that creates a heteroduplex with a mismatched or unpaired base. 
5 Although MutS would be expected to bind to other mismatches found in the test DNA, 

these should be rare. This is so because in this method the test DNA is never denatured and 
annealed, a process which can cripple current MutS-based mutation detection technologies 
because of the random mismatches created by homologous (or nearly homologous) DNA 
annealing and by incomplete annealing which results in regions of ssDNA that can form 
10 mismatch-containing secondary structure. RecA has the capacity to coat any ssDNA in the 
sample, but, for the reasons described above, stretches of ssDNA will be very rare. 

The sequence of interest will, in general, be present only once per chromosome. It is 
therefore a simple matter to set the flow conditions to detect each occurrence of RecA-MutS or 
probe DNA-MutS juxtaposition. To overcome random MutS binding (if it occurs), DNA 
15 fragment size can be reduced, by shearing or nuclease digestion. The effect of this reduction is to 
minimize the likelihood of random MutS binding in the same fragment in which the probe and 
RecA bind. 

fe another generally applicable embodiment of this invention, two complementary DNA 
probes can be employed instead of one. These probes are preferably precoated separately with 

20 RecA before being added to the test DNA in order to minimize their self-annealing. The two 
complementary, ssDNA probes will bind to both strands of test DNA in the region of interest, 
thereby: (1) helping stabilize the D-loop structure by forming a probe-length double duplex, (2) 
assuring detection of all mutant/wild type pairings, including those arising from G to C or C to G 
transversions, and (3) increasing signal, particularly from poorly recognized mismatches. The 

25 assurance of recognition in (2), above, is particularly important because C:C mismatches are not 
detected in vivo or in any mismatch detection methods developed to date, whereas the reciprocal 
G:G mismatches are always well detected. The present inventors and their colleagues have 
discovered that increasing the nimiber of mismatches in a single heteroduplex fragment increases 
binding by MutS and each duplex region in a double duplex structure will contain a mismatch. 

30 In another generally applicable embodiment of this invention, MutS may be inmiobilized, 

and either the probe of the RecA may be detectably labeled. In this embodiment, binding of the 
probe or RecA to immobilized MutS is indicative of one or more mismatches or unpaired bases in 
the D-loop structure formed between the probe and test DNA. 
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In another generally applicable embodiment, test dsDNA is immobilized to a solid 
support, to allow detection of mutations and SNPs by labeling, as described above. Here, the 
prefeired test DNA is PCR-amplified UNA. Immobilization of amplified DNA can be 
accomplished by using a 5' label, such as biotin, or a carboxy or amino group on one of the 
primers. Amplified DNA with a 5' biotin label can be immobilized to a solid support via avidin 
or streptavidin binding. Probe DNA is coated with RecA and mixed with the immobilized DNA 
or mixed with ttie DNA prior to immobilization under conditions tiiat allow triple strand or D- 
loop structure formation. Labeled MutS, or a homologue, is added either before, after or during 
triple strand or D-loop structure formation. Binding of MutS (or its homologue) to the 
immobilized DNA indicates flie presence of one or more mismatches or unpaired bases in the 
triple strand or D-Ioop structure. This embodiment is ideally suited for use in microarray (DNA 
chip) applications. 

In another generally applicable embodiment, oligonucleotide probes are prepared with a 
5' adduct to allow immobilization of the probe/test complex as in Rigas a/., sig)ra. The adduct 
may be a biotin moiety, a specific oligonucleotide or any other adduct that would allow specific 
retrieval of the oligonucleotide. The probe is mixed with RecA to form RecA filament and then 
mixed with test DNA to form specific D-loop structures. The probes may contain an additional 
detectable label or may be labeled after D-loop formation by allowing the amiealed 
oligonucleotides to be extended by DNA polymerase using labeled nucleotide triphosphates or 
nucleotide triphosphate analogues. In this case, only those oligonucleotides which form D-loop 
structures will be labeled, which wiU reduce, even fiirther, any background signal. Further 
specificity can be obtained by using nucleotide terminators, such as dideoxy- or acyclo-nucleotide 
triphosphates in a mix of aU four terminators wherein label is associated only with the terminator 
which is complementary to the first base in the test DNA beyond flie 3' end of the 
oligonucleotide. 

The 5' label of the oligonucleotide can be used to immobilize the D-loop structure to any 
solid support. Association of the MutS label with the 3' label of the oligonucleotide indicates the 
presence of one or more mismatches or unpaired bases in the D-loop region and will be 
diagnostic of the presence or absence of a specific sequence in the test DNA. If the solid support 
is a microtiter plate, the ratio of MutS signal to oligonucleotide signal will be characteristic of flie 
genotype of the test DNA. For example, a very high ratio (extensive MutS binding) indicates 
homozygosity for the test genotype with sequence different flian the probe, wherein 
heteroduplexes in the D-loop contain mismatches or unpaired bases. A low ratio (little of no 

t 
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MutS binding) indicates homozygosity for the test genotype with sequence identical to the probe, 
wherein heteroduplexes in the D-loop are perfectly paired. An intermediate ratio (approximately 
half of the high ratio), indicates a heterozygous genotype, wherein approximately half the 
heteroduplexes in D-loops are perfectly paired and half contain mismatches or unpaired bases. 

In applying the present method to clinical diagnosis, it is possible to perform multiple 
assays with a small blood sample (because 1 yil of blood contains about 10"* copies of each 
sequence). For the sequence being detected, the nimiber of positive signals in a sample is an 
indication of the genotype, as low or no signal indicates that the sample is homozygous for a 
sequence perfectly complementary to the probe. A very high signal indicates homozygosity for a 
sequence differing from the probe by one or a few single nucleotide substitutions or one or a few 
unpaired bases (as determined by the recognition properties of MutS or its homologue). An 
intermediate signal indicates heterozygosity for the sequence of interest. 

For routine mutation detection, standards {ue,, known genotypes) are run with each batch 
of test samples to provide a standard curve for genotype determination. Of course, standard ciffve 
formation and genotype determination depend on accurate test DNA quantitation. Thus, when 
RecA-catalyzed D-loop formation, MutS binding and flow cytometric signal detection are all 
efiScient, so that, e.g.^ 5,000 - 20,000 sequences are sufficient for a genotype determination, as 
many as 1000 or more separate assays can be performed on a single ml of blood. 

This technology is ideally suited to multiplexing wherein several sites in a single sample 
of genomic, plasmid or amplified DNA are interrogated simultaneously. Li this application, 
specific probes are designed with adducts, most preferably 5' oligonucleotide adducts that allow 
individual probes + test DNA (D-loop or triple strand stmctures) to be separately isolated firom a 
mixture of many probes and test DNA. When 5' oligonucleotides are used, isolation involves 
annealing the specific oligonucleotide adducts to immobilized oligonucleotides of complementary 
sequence. Probe label for detection can be added to the 3' opposite end of the probe from the 
isolation moiety or can be added by polymerase during the reaction (see below). 

It is also possible to run multiple sequential tests on a single sample by deproteinizing the 
sample after each flow cytometry run. Removal of RecA and MutS will cause any three stranded 
stmctures to Ml apart. The oligonucleotide probe can then be removed by passing the sample 
through a mini DNA binding column (which does not retain short oligonucleotides). The test 
DNA can then be recycled through the entire process with a new probe. This entire process can 
easily be automated using methods and tecl^lologies well-known in the art. 
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Example of RecA/MutS System Combined with Immobilized MutS 

Because RecA facilitates heteroduplex formation without requiring denaturation of test 
DNA, the present RecA/MutS system can be used as an enhancement for virtually any 
mutation/SNP detection method which depends on formation of heteroduplex DNA. This 
5 includes: (1) gel shift assays, (2) filter binding assays, (3) mismatch cleavage assays and (4) 

Immobilized Mismatch Binding Protein (IMBP) assays. IMBP assays, developed by one of the 
present inventors, utilizes components of the present RecA/MutS system, including labeled probe 
DNA and MutS (immobilized). 

However, the most successfiil IMBP assay formats require probes of length equal to the 

10 test DNA (generally a PGR ampHcon). Combining the RecA/MutS system with the IMBP assay 
accomplishes the following : (1) eliminates the need to denature test DNA, (2) allows the use of 
longer PGR products than were usable with previous IMBP assay formats, (3) allows the 
replacement of long probes (which are generally produced by PGR amplification of cloned 
sequences) with shorter synthetic oligonucleotide probes, and (4) allows a single PGR product to 

15 be examined at several different sites along its sequence by using a combination of short 
oligonucleotide probes. 

One major problem afflicting mutation detection assays that require PGR amplification of 
test DNA is the introduction of errors during amplification. PGR is well known to be a low 
fidelity process when compared to in vivo DNA replication Errors introduced in each 

20 amplification cycle are propagated throughout the entire process and, because the errors are, in 

general, introduced at random positions, they will all form mismatches when the PGR amplicon is 
denatured and annealed. Because the number of "errors per fragment" is of primary concern and 
because the likelihood of an error in a given fiagment clearly depends on fiiagment length, the 
fi^equency of PGR errors limits the length of PGR firagments that can be used in mutation 

25 detection assays. 

The RecA/MutS system of the present invention minimizes or eliminates these concerns 
about PGR errors by completely eliminating the need to amplify or denature the test DNA. 
Therefore, even when amplification is employed, the only errors that will affect the assay are 
those created in the final PGR cycle (because earlier errors will be copied correctly in the final 
30 PGR cycle and the molecules will not, thereafter, be denatured. Moreover, probe annealing to the 
test DNA will not introduce a significant number of mismatches or unpaired bases since the 
probes of the present invention are preferably very short (i.e., about 20-60 nucleotides). 
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Conditions described above are employed when applying the RecA/MutS method 
to IMBP assays. The probe can be of any length and from any source, but is preferably a 
synthetic oligonucleotide or a pair of complementary oligonucleotides (see above) of 
about 20-60 nucleotides in length. Again, the probe is specific for (1) a particular 
mutation or polymorphism or (2) specific for a particular region being examined for the 
presence of known or unknown single nucleotide polymorphism. 

When a pair of probes is used (see above), they are preferably mixed individually with 
RecA. Formation of the RecA filaments is permitted to take place either in the presence of test 
DNA or before mixing with test DNA. 

Test DNA may be from any source, preferably genomic DNA or PGR amplified DNA. 

D-loops (or double duplexes when a pair of complementary probes is used) may be 
formed before mixing with (or in the presence of) immobilized MutS (or other MBP). 
Immobilization may be to any solid support or carrier. By "solid support** or "carrier** is intended 
any support capable of binding a protein while permitting washing without dissociating from the 
protein. Well-known supports or carriers include, but are not limited to, natural cellulose, 
modified cellulose such as nitrocellulose, polj^tyrene, polypropylene, polyethylene, 
poiyvinylidene difluoride, dcTCtran, nylon, polyacrylamide, and agarose or Sepharose®. Also 
usefiil are magnetic beads. The support material may have virtually any possible structural 
configuration so long as the immobilized MBP is capable of binding to the target nucleic acid 
molecule. Thus, the support configuration can include microparticles, beads, porous and 
impermeable strips and membranes, the interior surface of a reaction vessel such as test tubes and 
microtiter plates, and the like. Those skilled in the art will know many other suitable carriers for 

« 

binding the MBP or will be able to ascertain these by routine experimentation. 

As above, SSB protein is optionally used to facilitate D-loop formation and increase D- 
loop stability. As noted above, MutS stabilizes D-loop structures that include a mismatch or 
unpaired bases. 

Detection of MutS-bound DNA is normally accomplished by using labeled probes. 
Probes may be labeled with any fluorophore, chromophore, radionuclide or luminescer prepared 
by any labeling method, including those described above. Probe labeling may also be 
accomplished by polymerase mediated extension of the oligonucleotide in the D-loop structure 
using labeled nucleotides or nucleotide analogues. Labeling of RecA, particularly via a labeled 
anti-RecA antibody, would amplify the signal so that genomic DNA can be tested without prior 
amplification. 
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It is difficult to overstate the power of the RecA/MutS method. It is rapid, works with 
small samples and can readily be adapted to clinical applications for diagnostic genotyping and 
mutation/SNP detection. Perhaps the most important distinguishing advantage of the present 
invention is its complete independence from DNA amplification (z.e., PGR). 
5 KITS 

The present invention is also directed to kit or reagent systems useful for practicing the 
methods described herein. Such kits will contain a reagent combination comprising the essential 
elements required to conduct an assay according to the methods disclosed herein. The reagent 
system is presented in a commercially packaged form, as a composition or admixture where the 

1 0 compatibility of the reagents will allow, in a test device configuration, or more typically as a test 
kit, i.e., a packaged combination of one or more containers, devices, or the like holding the 
necessary reagents, and usually including written instmctions for the performance of assays. The 
kit of the present invention may include any configurations and compositions for peifomiing the 
various assay formats described herein. 

15 Kits containing RecA, MutS and, where applicable, antibodies and/or SSB, are within the 

scope of this invention. In one embodiment, a kit of this invention designed to allow detection of 
specific mutations and/or polymorphisms or mutations and/or polymorphisms in specific regions 
of target DNA, includes oligonucleotides or other probes specific for (a) selected mutations 
and/or (b) SNPs, or (c) specific region or regions of target DNA (to allow scaiming of regions for 

20 any mutations or polymorphisms, known or unknown). The probes may be labeled as described 
above. The kits also include labeled MutS, or antibodies allowing detection of MutS, which may 
be immobilized to a solid support or carrier or provided in immobilizable form with separate 
carrier; RecA, which may be labeled; and a plurality of containers of appropriate buffers and 
reagents. 

25 Another kit, is designed to allow ends users to design their own probes for detection of 

mutations and/or polymorphisms or to scan a DNA region of their choice; such a kit contains all 

of the above described reagents except probe DNA. 

. ■ « . 

The references cited above are all incorporated by reference herein, whether specifically 
incorporated or not 

30 Having now fiilly described this invention, it will be appreciated by those skilled in the art 

that the same can be performed within a wide range of equivalent parameters, concentrations, and 
conditions without departing &om the spirit and scope of the invention and without undue 
e^erimentation. 
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WHAT IS CLAIMED IS; 

1 . A method for detecting a mutation and/or a SNP in a double-stranded test DNA 
molecule, comprising: 

(a) providing a single stranded DNA probe which is optionally detectably labeled, 
5 which probe has (i) a known nucleotide sequence or (ii) a sequence 

complementary to the sequence of at least a part of the test DNA; 

(b) contacting the probe with a RecA protein which is optionally detectably labeled, 
to form a RecA filament; 

(c) contacting the RecA filament with the test DNA, thereby forming a three stranded 
10 DNA D-loop stmcture in the test DNA, which D-loop structure comprises the 

probe and the two strands of the test DNA; 

(d) contacting the DNA D-loop stmcture witii a MutS protein which is optionally 
detectably labeled, wherein the MutS binds to one or more base pair mismatches 
or unpaired bases present in the duplex portion of D-loop stmcture; 

1 5 (e) detecting the presence of MutS bound to the DNA D loop stmcture, 

wherein the presence of the bound MutS is indicative of the presence of the mutation or the SNP 
in the test DNA. 

2. A method for detecting a mutation and/or a SNP in a double-stranded test DNA 
molecule, comprising: 

20 (a) providing a probe comprising t\yo complementary single stmnded 

oligonucleotides which are optionally detectably labeled, which probe has a 
known nucleotide sequence or a sequence complementary to the sequence of at 
least a part of the test DNA; 

(b) contacting each of the oligonucleotides in single stranded form with a RecA 
25 protein, which is optionally detectably labeled, to form RecA filaments, 

(c) contacting the RecA filaments with the test DNA, tiiereby forming a four stranded 
DNA stmcture in the test DNA, which stmcture comprises the probe and the two 
strands of the test DNA and wherein the probe strandis are annealed with test DNA 
strands; 

30 (d) contacting said DNA stmcture with a MutS protein which is optionally detectably 

labeled, wherein the MutS binds to one or more base pair mismatches or unpaired 
bases present in the four stranded DNA stmcture; 

(e) detecting the presence of the MutS bound to the DNA stmcture 
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wherein the presence of the bound MutS is indicative of the presence of the mutation or the SNP 
in the test DNA. 

3. A method for detecting a mutation and/or a SNP in a double-stranded test DNA 
molecule, comprising: 

(a) providing a single stranded DNA probe which is optionally detectably labeled, 
which probe has (i) a known nucleotide sequence or (ii) a sequence 
complementary to the sequence of at least a part of the test DNA molecule; 

(b) contacting the probe with a RecA protein which is optionally detectably labeled, 
to form a RecA filament, 

(c) contacting the RecA filament with the test DNA, thereby forming a tiiree stranded 
DNA D~loop structure in the test DNA, which stmcture comprises the probe and 
two strands of the test DNA; 

(d) contacting the DNA D-loop structure with immobilized MutS which binds to one 
or more base pair mismatches or unpaired bases present in the duplex portion of 
the D-loop structure; 

(e) detecting the presence immobilized probe DNA or RecA bound to the MutS, 
wherein the presence of the bound probe DNA or RecA is indicative of the presence of the 
mutation or the SNP in the test DNA. 

4. A method for detecting a mutation and/or a SNP in a double-stranded test DNA 
molecule, comprising; 

(a) providing a probe comprising two complementary single stranded 
oligonucleotides which are optionally detectably labeled, which probe has a 
known nucleotide sequence or a sequence complementary to the sequence of at 
least a part of the test DNA; 

(b) contacting each of flie probe oligonucleotides in single stranded form with a RecA 
protein, which is optionally detectably labeled, to form RecA filaments, 

(c) contacting the RecA filaments with the test DNA, thereby forming a four stranded 
DNA structure in the test DNA, which structure comprises the two strands offtxe 
test DNA to each of which is aimealed a probe oligonucleotide strand; and 

(d) contacting said DNA structure with immobilized MutS which binds to one or 
more base pair mismatches or unpaired bases present in said four stranded DNA 
structure, 

thereby detecting the mutation and/or SNP. 
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5. The method of any of claims 1-4 wherein the mutation being detected is a single 
nucleotide substitution or the addition or deletion of 1-4 nucleotides. 

6. The method of any of claims 1-4 wherein the test DNA molecule is selected from 
the group consisting of prokaryotic genomic DNA, eukaryotic genomic DNA, cDNA, viral DNA, 

5 plasmid DNA, and a DNA fragment amplified by PGR or by another amplification method. 

7. The method of any of claims 1-4 wherein the probe is selected from the group 
consisting of: 

(a) a synthetic oligonucleotide; 

(b) a recombinant oligonucleotide; 

10 (c) an oligonucleotide obtained by denaturing, and, optionally cleaving, a double 

stranded DNA molecule. 

8. The method of claim 7, wherein the oligonucleotide has a length of about 20 to 
about 60 nucleotides. 

9. The method of any of claims 1—4, wherein: 

15 (i) the probe and the MutS are labeled; ' 

(ii) the label is a fluorophore, a chromophore, a radionuclide, biotin or digoxigenin; 
and 

(iii) association of the probe label v^dth the MutS label is indicative of the presence of 
the mutation or the SNP in the test DNA. 

20 10, The method of any of claims 1-4, wherein the RecA protein is from E, colL 

1 1 . The method of any of claims 1-4, wherein 

(i) the RecA and MutS are labeled; 

(ii) the label is a fluorophore, a chromophore, a radionuclide, biotin or digoxigenin; 
and 

25 (iii) association of the RecA label with the MutS label is indicative of the presence of 

the mutation or the SNP in the test DNA. 

12. The method of claim 1 or 2 wherein the MutS is inmiobilized to a solid support. 

13. The method of claim 1 or 2 wherein the detectable MutS label is a fluorophore, a 
chromophore, a radionuclide, biotin, digoxigenin, a detectably labeled bead, a detectable labeled 
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anti-MutS antibody, or a combination of an unlabeled anti-MutS antibody and a detectably 
labeled secondary antibody specific for the anti-MutS antibody. 

14. The method of claim 1 or 2 wherein the RecA protein is labeled and the detection 
is of the MutS label associated with the RecA label present in the DNA D loop structures. 

5 15. The method of any of claims 1 - 4, wherein the detectable RecA label is in the 

form of a detectably labeled primary anti-RecA antibody, or a combination of an unlabeled anti- 
RecA antibody and a detectably labeled antibody specific for the anti-RecA antibody. 

16. The method of any of claims 1-4, wherein one or more of the detectably labeled 
probe, the detectably labeled RecA and/or the detectably labeled MutS is labeled with a 

1 0 fluorophore. 

1 7. The method of any of claims 1 -4, wherein the detecting is by flow cytometry. 

18. The method of claim 16 wherein the detecting is by flow cytometry. 

19. The method of claim 1 or 3, wherein the DNA D loop stmcture is stabilized by the 
addition, before step (d), of SSB protein which is optionally detectably labeled. 

15 20. Themethodof claim 19, whereiii, 

(i) the SSB protein is labeled with a detectable label; 

(ii) the label is a fluorophore, a chromophore, a radionuclide, biotm, digoxigenin, a 
labeled anti-SSB antibody, or a combination of an unlabeled anti-SSB antibody 
and a labeled secondary antibody specific for the anti-SSB antibody; and 

^0 ("0 association of the SSB label with the MutS label is indicative of the presence of 

the mutation or the SNP in the test DNA. 

21. The method of claim 1 or 3 wherein the detecting is by flow cytometry which 
detects the coincidence of two, three or four labels which are bound to: 

(a) ' MutS and the probe; 

25 (b) MutSandRecA; 

(c) MutS, RecA and the probe; 

(d) MutS and SSB; 

(e) MutS, SSB and the probe; or 

(£) MutS, SSB, the probe and RecA. 
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22. The method of any of claims 1-4 wherein the probe is labeled by polymerase 
extension using labeled deoxynucleotide triphosphates or nucleotide terminators. 

23. The method of claims 1 or 2, wherein the test DNA is immobilized to a solid 
support. 

24. The method of claim 1 or 2, wherein the probe is bonded to an adduct that allows 
immobilization of the probe following formation of said D-loop structure. 

25. The method of claim 24, wherein the adduct is an oligonucleotide. 

26. The method of claim 24, wherein the adduct is biotin or digoxigenin. 

27. A kit useful for detecting a one or more mutations or polymorphisms in a DNA 
sample, the kit being adapted to receive therein one or more containers, the kit comprising: 

(a) a first container containing a RecA protein which is optionally detectably labeled; 

(b) a second container containing MutS protein which is optionally detectably 
labeled; and 

(c) a third container or plurality of containers containing buffers and reagent or 
reagents capable of detecting boimd MutS. 

28. A kit useful for detecting a specific mutation or polymorphism or a specific group 
of mutations or polymorphisms in a DNA sample or for examining a specific region or regions of 
DNA for any mutations or polymorphisms, the kit being adapted to receive therein one or more 
containers, the kit comprising: 

(a) a first container containing RecA protein which is optionally detectably labeled; 

(b) a second container containing MutS protein which is optionally detectably 
labeled; 

(c) a third container or plurality of containers containing a specific oligonucleotide 
probe or probes, which probes are selected to be complementary to specific 
sequences in specific regions in the DNA of the sample and which form 
mismatch-containing or impaired base-containing heteroduplexes with a mutated 
or polymorphic sequence or sequences in the specific DNA regions, which probe 
or probes is or are optionally detectably labeled; and 

(d) A foiulh container or plurality of containers containing buffers and reagents 
capable of detecting MutS when it is boimd to specific heteroduplexes formed 
between the probes and the sample DNA. 
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